Docket MIPS: 0109. OOUS 



ATOMIC UPDATE OF CPO STATE 
by 

G. Michael Uhler 



Assignee: MIPS Technologies, Inc. 

1225 Charleston Road 
Mountain View, CA 94043-1353 



Address correspondence to: 

James W. Huffman 

1832 N. Cascade Ave. 

Colorado Springs, CO 8 0907 

719.475.7103 

719.623 .0141 

j im@huf f manlaw . net 



Docket MIPS: 0109. OOUS 

ATOMIC UPDATE OF CPO STATE 

by 

G. Michael Uhler 

CROSS REFERENCE TO RELATED APPLICATION ( S ) 

[0001] This application is related to U.S. Patent 

application Serial No. (MIPS : 0110 . OOUS) 

entitled "READ ONLY ACCESS TO CPO REGISTER" assigned to 
MIPS Technologies, Inc. 

FIELD OF THE INVENTION 

[0002] This invention relates in general to the field of 
computer architecture, and more specifically to a method 
and apparatus for atomically updating privileged 
architecture registers within a microprocessor. 

BACKGROUND OF THE INDENTION 

[0003] A microprocessor is a digital device that is 
utilized to execute instructions defined within a computer 
program. Such instructions include adding two data values 
together, comparing data values, moving data from one 
location to another, etc. In addition, a microprocessor is 
designed to communicate with other devices, such as network 
controllers, disk drives, keyboards, displays, etc., that 
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are external to it. Typically, such external devices 
request the attention of the microprocessor by signaling 
one of several interrupt lines on the microprocessor. When 
this occurs, the microprocessor halts program execution, 
runs a special program called an "exception handler" to 
determine which device requested the interrupt, and what 
the nature of the interrupt is. Once the microprocessor 
has determined the nature of the interrupt, it executes an 
interrupt routine that is specific to the nature of the 
interrupt. Upon completion, the microprocessor typically 
returns to the program it was executing before the 
interrupt occurred. 

[0004] One skilled in the art will appreciate that the 
above description of interrupt handling is very simplistic. 
That is, it has not considered the difference between 
hardware interrupts, software interrupts, internal 
exceptions, etc., nor has it considered what occurs when 
multiple interrupts occur simultaneously, or when a second 
interrupt occurs during processing of a first interrupt. 
However, the way a microprocessor handles an interrupt is, 
in many ways, dependent on the specific architecture of the 
microprocessor. For purposes of illustrating the present 
invention, an interrupt problem will be described with 



3 



Docket MIPS : 0109 . O0US 

particular reference to the architecture of MIPS32™ and 
MIPS64™ microprocessors (hereinafter MIPS @ microprocessors) 
designed by MIPS Technologies, Inc. of Mountain View, 
California. 

[0005] More specifically, like many modern 

microprocessors, MIPS microprocessors contain privileged 
control registers that are updated during events such as 
interrupts and excepbions. These control registers are not 
directly accessible by user programs that execute on the 
microprocessors. Rather, when an interrupt or exception 
occurs, program execution is halted, and a special program, 
called an exception handler, is run which accesses the 
control registers, both to determine the nature of the 
interrupt, and to set or clear bits within one or more of 
the control registers, which control how the interrupt will 
be handled. (Note: The exception handler typically 
resides within an area of memory that can only be accessed 
by the microprocessor if it is in "kernel" mode. That is, 
to prevent user programs from accessing or changing the 
contents of the privileged control registers, a task switch 
must first occur which authorizes the execution of the 
exception handler, and thus access to the control 
registers.) Once the exception handler has read the 
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appropriate control registers, and established how the 
interrupt will be handled, an interrupt service routine is 
used to service the interrupt. Upon completion of 

servicing the interrupt, the exception handler returns 
execution to the user program. 

[0006] A particular control register within a MIPS 
microprocessor is called the Status Register (SR) . The SR 
contains a number of bit fields that may be examined, and 
set or cleared, during handling of an interrupt. Like most 
control registers, the SR register cannot be modified 
directly. Rather, the process for examining and modifying 
the contents of the SR register involves the steps of: 1) 
Reading the contents of the SR register into one of the 
general purpose registers within the microprocessor; 2) 
Modifying, or setting/clearing particular bits within the 
general purpose register that contains the contents of the 
SR register; and 3) Writing the modified contents of the 
general purpose register back into the SR register. This 
three- step process is referred to as a R-M-W sequence, and 
is often required during interrupt processing. 

[0007] Sample program code that performs this R-M-W 

sequence is shown below: 
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Label 



Instruction 
mfcO 

or 
and 

mtcO 



Operands 
t0 f SR 

to, things_to_set 

tO , things_to__preserve 

tO,SR 



[0008] The first instruction "mfcO" causes the 
microprocessor to move the contents of the SR register into 
a general purpose register designated tO. This is the read 
cycle. The next two instructions "or" and "and" set or 
preserve particular bits within the tO register, 
corresponding to the "things_to_set" mask, or the 
"things_to_preserve" mask. This is the modify cycle. The 
fourth instruction v mtc0" causes the microprocessor to move 
the contents of the tO register back into the SR register. 
This is the write cycle. Specifics relating to which of 
the SR bits are modified during an interrupt, and how the 
above instructions operate, may be found in the MIPS32 or 
MIPS64 architecture specifications, which are available 
from MIPS Technologies, Inc., and incorporated herein by 
reference for all purposes. 

[0009] During execution of an interrupt service routine, 
it may be desirable to perform the above described R-M-W 
sequence. One problem that exists is that the R-M-W 
sequence may itself be interrupted. For example, suppose 
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that the R-M-W sequence described above is executing in 
response to a first interrupt. Further, suppose a second 
interrupt occurs between label 1 and label 2 above . The 
processor might halt execution of this R-M-W sequence and 
begin servicing the second interrupt. Now, suppose that 
while servicing the second interrupt, the contents of the 
SR register are changed. Upon return from servicing the 
second interrupt, the code at label 2 will write its 
altered value into SR, but will lose any changes made to 
the SR register by the second interrupt routine. Such a 
situation is unacceptable since changes to the SR register 
cannot be ignored. 

[0010] In response to the above problem, several 
solutions have been developed to prevent changes in the SR 
register from being lost or ignored. A first solution is 
to simply disable all interrupts prior to performing the R- 
M-W sequence above. However, this situation is a catch-22, 
because in order to disable interrupts, interrupt mask bits 
in the SR register must be cleared, the clearing of which 
requires the same R-M-W sequence! What happens if an 
interrupt occurs while executing the R-M-W sequence that 
disables interrupts? One method of solving this problem is 
to insist that no interrupt change the value of SR during 
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execution of any interrupt code. That is, by requiring 
that all interrupt code begin by saving off the value of 
SR, and end by restoring the saved value of SR back into 
SR, it can be assured that the SR value doesn't change. In 
many environments, however, this is considered too 
restrictive, as well as too time consuming. Moreover, in 
operating environments where interrupt routines are not 
strictly nested, you can never be sure you return to the 
spot where the proper SR value can be restored. 

[0011] A second solution is to use a system call to 
disable interrupts. A system call works by causing an 
exception that disables interrupts in a single cycle. 
Then, the R-M-W sequence can proceed without risk of 
interruption. Many programmers consider use of a system 
call, to simply overcome the inherent problems of updating 
the SR register, overly dramatic, and again, too time 
consuming . 

[0012] Therefore, what is needed is a mechanism that 
allows a programmer to set or clear bits within a control 
register of a processor, while guaranteeing that the 
modification process is not interrupted. 

[0013] Furthermore, what is needed is a method and 
apparatus that sets or clears selected portions of a 
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privileged control register atomically (i.e., within a 
single non- interrupt ible instruction) . By modifying 

control bits atomically, the above presented problems 
associated with the R-M-W sequence are solved without 
imposing any additional processing delays, and without 
insisting that interrupt routines be strictly nested. 

SUMMARY 

[0014] The present invention provides a method and 
apparatus for atomically modifying privileged architecture 
control registers when a microprocessor is in a privileged 
state . 

[0015] In one aspect, the present invention provides a 
microprocessor with a control register that is modifiable 
by a privileged (kernel) instruction, where the control 
register has bit fields. The microprocessor includes a 
core, to receive the privileged instruction and to modify 
the control register upon execution of the privileged 
instruction. The privileged instruction includes: an 

opcode and first and second operands. The opcode 

identifies the instruction as a privileged instruction. 
The first operand specifies the control register as a 
register to be modified. The second operand specifies a 
location of a second register within the microprocessor, 
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the second register containing a bit mask, the bit mask 
determining which of the bit fields within the control 
register are to be modified. The bit mask is used to set 
or clear the bit fields in the control register atomically. 

[0016] In another aspect, the present invention provides 
a method for atomically modifying interrupt mask bits 
within a privileged control register of a microprocessor. 
The method includes: providing a privileged instruction 
which instructs the microprocessor to clear particular ones 
of the interrupt mask bits; providing a bit mask, for 
specifying which of the particular ones of the interrupt 
mask bits are to be cleared; and upon receipt of an 
interrupt by the microprocessor, atomically clearing the 
particular ones of the interrupt mask bits as specified by 
the bit mask. 

[0017] In yet another aspect, the present invention 
provides a bit clear instruction to execute on a 
microprocessor that has a privileged control register, 
where the bit clear instruction executes on the 
microprocessor when the microprocessor receives an 
interrupt and when the microprocessor is in a privileged 
state. The bit clear instruction includes a plurality of 
opcode bits to designate the bit clear instruction as a 
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privileged instruction; a plurality of first operand bits 
to designate the privileged control register as the 
register to be operated upon by the bit clear instruction; 
and a plurality of second operand bits to designate a 
general purpose register as a register containing a bit 
mask, where the bit mask specifies which of a plurality of 
bits within the privileged control register are to be 
cleared. When the bit clear instruction executes on the 
microprocessor, the plurality of bits within the privileged 
control register that are specified by the bit mask are 
cleared, atomically . 

[0018] In a further aspect, the present invention 
provides a computer program product for use with a 
computing device. The computer program product includes a 
computer usable medium, having computer readable program 
code embodied in the medium, to cause a microprocessor 
having a control register that is modifiable by a 
privileged instruction to be described. The computer 
readable program code includes first program code to 
provide a core, to receive the privileged instruction and 
to modify the control register upon execution of the 
privileged instruction. The computer readable program code 
also includes second program code to provide the privileged 



11 



Docket MIPS: 0109. OOUS 

instruction, the privileged instruction including an opcode 
and two operands. The opcode identifies the instruction as 
a privileged instruction. The first operand specifies the 
control register as a register to be modified. The second 
operand specifies a location of a second register within 
the microprocessor, the second register containing a bit 
mask, the bit mask determining which of the bit fields 
within the control register are to be modified, where the 
bit mask is used to set or clear the bit fields in the 
control register. The bit fields in the control register 
are modified atomically by the privileged instruction. 

[0019] In yet another aspect, the present invention 
provides a computer data signal embodied in a transmission 
medium. The data signal includes computer- readable first 
program code to provide a bit clear instruction for 
execution on a microprocessor having a privileged control 
register, the bit clear instruction executing on the 
microprocessor when the microprocessor receives an 
interrupt and when the microprocessor is in a privileged 
state. The first program code includes: a) program code to 
provide a plurality of opcode bits for designating the bit 
clear instruction as a privileged instruction; b) program 
code for providing a plurality of first operand bits for 
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designating the privileged control register as the register 
to be operated upon by the bit clear instruction; and c) 
program code for providing a plurality of second operand 
bits for designating a general purpose register as a 
register containing a bit mask, the bit mask specifying 
which of a plurality of bits within the privileged control 
register are to be cleared. When the bit clear instruction 
executes on the microprocessor, the plurality of bits 
within the privileged control register that are specified 
by the bit mask are cleared, atomically. 

[0020] Other features and advantages of the present 
invention will become apparent upon study of the remaining 
portions of the specification and drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0021] FIGURE 1 is a block diagram of a microprocessor 
coupled to coprocessors according to the present invention. 

[0022] FIGURE 2 is a block diagram illustrating the bits 
of a Coprocessor 0 status register (SR) . 

[0023] FIGURE 3 is a flow chart illustrating a read- 
modify-write (R-M-W) sequence required for updating the 
status register in Coprocessor 0. 
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[0024] FIGURE 4 is a flow chart illustrating one 
methodology for dealing with an interrupt during the SR 
update process provided in Figure 3 . 

[0025] FIGURE 5 is a flow chart illustrating a second 
methodology for dealing with an interrupt during the SR 
update process provided in Figure 3 . 

[0026] FIGURE 6 is a table illustrating four 
instructions for atomically updating the SR of Coprocessor 
0. 

[0027] FIGURE 7 is a block diagram illustrating the bit 
architecture for the four atomic update instructions of 
Figure 6 . 

[002 8] FIGURE 8 is a flow chart illustrating the impact 
of an interrupt receiving during an atomic update of the SR 
using the atomic update instructions illustrated in Figures 
6 and 7 . 

DETAILED DESCRIPTION 

[002 9] Referring to Figure 1, a block diagram 10 0 is 
shown illustrating a microprocessor (or CPU) 102 coupled to 
a floating point Coprocessor 112 and a second Coprocessor 
114 via an interface bus 116. In one embodiment, the 
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second Coprocessor 114 may be a graphics Coprocessor, an 
audio Coprocessor, or any other Coprocessor capable of 
executing instructions not designed specifically for the 
CPU 102. The CPU 102 includes an execution core 104, 
coupled to a general purpose register file (GPR) 106 for 
receiving instructions from memory (or a cache) and for 
execution of CPU 102 instructions. When floating point 
instructions are presented to the core 104, it provides 
those instructions to the floating point coprocessor 112 
for execution. When instructions are presented to the core 
104 that are designed for the second Coprocessor 114, the 
CPU 102 provides those instructions to the second 
Coprocessor 114. While the present invention is not 
restricted to any specific CPU, for purposes of discussion, 
the invention will be described with reference to the MIPS 
32 and 64 bit processor architectures, available from MIPS 
Technologies, Inc. of Mountain View, California. 

[003 0] In addition to its normal computational 
functions, the CPU 102 includes a control unit to handle 
interrupts 120, configuration options e.g., such as the 
endianness of data, and observing or controlling on-chip 
functions like caches and timers. The control unit within 
the CPU 102 for performing such tasks is called Coprocessor 
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0 108, or CPO. CPO 108 is coupled to control registers 110 
which contain values that set how the CPU 102 is to 
operate. For a listing of the control registers 110 within 
the MIPS 32/64 bit architecture, reference is made to Table 

1 below. A complete description of each of these registers 
may be found in the MIPS32 and MIPS64 architecture 
Specifications, which are available from MIPS Technologies, 
Inc., and which are incorporated herein by reference for 
all purposes . 

[0031] CPO provides an abstraction of the functions 
necessary to support an operating system: exception 
handling, memory management, scheduling, and control of 
critical resources. The interface to CPO is through 
various instructions encoded with a COP0 opcode, including 
the ability to move data to and from the control registers 
110. The control registers 110 and the interaction with 
them make up what is called the Privileged Resource 
Architecture . 
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Table 1 



Register 
Number 


Sel 


Register Name 


Function 


0 


0 


Index 


Index into the TLB array 


1 


0 


Random 


Randomly generated index into the 
TLB array 


2 


0 


EntryLoO 


Low-order portion of the TLB entry for 
even-numbered virtual pages 


3 


0 


EntryLol 


Low-order portion of the TLB entry for 
odd-numbered virtual pages 


4 


0 


Context 


Pointer to page table entry in memory 


5 


0 


PageMask 


Control for variable page size in TLB 
entries 


6 


0 


Wired 


Controls the number of fixed ("wired") 
TLB entries 


7 


All 




Reserved for future extensions 


8 


0 


BadVAddr 


Reports the address for the most 
recent address-related exception 


9 


0 


Count 


Processor cycle count 


10 


0 


EntryHi 


High-order portion of the TLB entry 


11 


0 


Compare 


Timer interrupt control 


12 


0 


Status 


Processor status and control 


13 


0 


Cause 


Cause of last general exception 


14 


0 


EEPC 


Program counter at last exception 


15 


0 


PRId 


Processor identification and revision 


16 


0 


Config 


Configuration register 


16 


1 


Configl 


Configuration register 1 


17 


0 


LLAddr 


Load linked address 


18 


0-n 


WatchLo 


Watch point address 
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Register 

Niimhpr 


Sel 


Register Name 


Function 


19 


0-n 


WatchHi 


Watchpoint control 


20 


0 


XContext 


XContext in 64-bit implementations 


21 


All 




Reserved for future extensions 


11 


All 

All 




MVdllctUlfci 1UI 111 IjJIcll lt?l llcHtUI I 

dependent use 


23 


0 


Debug 


EJTAG Debug register 


24 


0 


DEPC 


Program counter at last EJTAG 
debuci exceDtion 


25 


0-n 


PerfCnt 


Performance counter interface 


26 


0 


ErrCtl 


Parity/ECC error control and status 


27 


0-3 


CacheErr 


oacne parity error coniroi dnu oidiuo 


28 


0 


TagLo 


Low-order portion of cache tag 
mien ace 


28 


1 


DataLo 


Low-order portion of cache data 
interlace 




U 


i agm 


Minh_nrHor nnrtinn nf rsichp tan 
niyll UfUci pui LiUN ui uciui ic lay 

interface 


29 


1 


DataHi 


High-order portion of cache data 
interface 


30 


0 


ErrorEPC 


Program counter at last error 


31 


0 


DESAVE 


EJTAG debug exception save register 



[0032] Of particular interest to the present invention 
is register number 12, called the Status Register (SR) , 
because it is the Status Register, among others, that is 
referenced when servicing interrupts or exceptions. 
Referring to Figure 2 a block diagram 200 is shown 
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illustrating the bit fields within a Status Register (SR) 
that is located within the control registers 110 of CPO 
108. For a description of each of the bit fields within 
the SR, reference is made to Table 2 below. 



Table 2 



Fields 


Description 


Read/ 
Write 


Name 

CU 


Bits 

31:28 


Controls access to coprocessors 3, 2, 1, 
and 0, respectively: 

0: access not allowed 
1: access allowed 

Coprocessor 0 is always usable when 
the processor is running in Kernel Mode 
or Debug Mode, independent of the 
state of the CU 0 bit. 

If there is no provision for connecting a 
coprocessor, the corresponding CU bit 
must be ignored on write and read as 
zero. 


R/W 


RP 


27 


Enables reduced power mode on some 
implementations. The specific 
operation of this bit is implementation 
dependent. 

If this bit is not implemented, it must be 
ignored on write and read as zero. If 
this bit is implemented, the reset state 
must be zero so that the processor 
starts at fuli performance. 


R/W 


FR 


26 


Controls the floating point register mode 
on 64-bit MIPS processors. Not used 
by MIPS32 processors. This bit must 
be ignored on write and read as zero. 


R 


RE 


25 


Used to enable reverse-endian memory 

references while the processor is 

running in user mode: 

0: User mode uses configured 

endianness 


R/W 
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Fields 


Description 


Read/ 
Write 


Name 


Bits 






1: User mode uses reversed 
endianness 

Neither Kernel Mode nor Supervisor 
Mode references are affected by the 
state of this bit. 

If thic hit i<5 nnt imnipmpntfid it must be 

ignored on write and read as zero. 




MX 


24 


Enables access to MDMX* resources 
on MIPS64* processors. Not used by 

MIPQ*39* nrnrfiQQnr^ Thte hit must be 

ignored on write and read as zero. 


R 


PX 


23 


Enables access to 64-bit operations on 
MIPS64 processors. Not used by 

r"v rr\ noccnrc Thic hit miiQt 

Mir oOt procesoorb. i mo uu muoi 
ignored on write and read as zero. 


R 


BEV 


22 


Controls the location of exception 
vectors: 
0: Normal 
1: Bootstrap 


R/W 


TS 


21 


Indicates that the TLB has detected a 
match on multiple entries. It is 
implementation dependent whether this 
detection occurs at all, on a write to the 
TLB, or an access to the TLB. When 
such a detection occurs, the processor 
initiates a machine check exception and 
sets this bit. It is implementation 
dependent whether this condition can 
be corrected by software. If the 
condition can be corrected, this bit 
should be cleared before resuming 
normal operation. 

If this bit is not implemented, it musi ue 
ignored on write and read as zero. 

Software writes to this bit may not 
cause a 0-1 transition. Hardware may 
ignore software attempts to cause such 
a transition. 


R/W 


SR 


20 


Indicates that the entry through the 
reset exception vector was due to a 
Soft Reset: 

0: Not Soft Reset (NMI or Reset) 


R/W 
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Fields 


Description 


Read/ 
Write 


Name 


Bits 






1: Soft Reset 

If this bit is not implemented, it must be 
ignored on write and read as zero. 

Software may only write a zero to this 
bit to clear the SR condition. Hardware 
may ignore software attempts to write a 
one to this bit. 




NMI 


19 


Indicates that the entry through the 
reset exception vector was due to an 
NMI 

0: Not NMI (Soft Reset or Reset) 
1: NMI 

If this bit is not implemented, it must be 
ignored on write and read as zero. 

Software may only write a zero to this 
bit to clear the NMI condition. 
Hardware may ignore software attempts 
to write a one to this bit. 


R/W 


0 


18 


Must be written as zero; returns zero on 
read 


0 


Impl 


17:16 


These bits are implementation 
dependent and not defined by the 
, architecture. If they are not 
implemented, they must be ignored on 
write and read as zero. 




IM7- 
IMO 


15:8 


Interrupt Mask: Controls the enabling of 

each of the external, internal and 

software interrupts. 

0: interrupt request disabled 

1 : interrupt request enabled 


R/W 


KX 


7 


Enables access to 64-bit kernel address 
space on 64-bit MIPS processors. Not 
used by MIPS32* processors. This bit 
must be ignored on write and read as 
zero 


R 


SX 


6 


Enables access to 64-bit supervisor 
address space on 64-bit MIPS 
processors. Not used by MIPS32* 
processors. This bit must be ignored on 
write and read as zero. 


R 
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Fields 


Description 


Read/ 
Write 


Name 


Bits 


UX 


5 


Enables access to 64-bit user address 
space on 64-bit MIPS processors. Not 
used by MIPS32* processors. This bit 
must be ignored on write and read as 
zero. 


R 


KSU 


4:3 


If Supervisor Mode is implemented, the 
encoding of this field denotes the base 
operating mode of the processor. The 
encoding of this field is: 

00: Base mode is Kernel Mode 
01 : Base mode is Supervisor Mode 

1fV Race mnrlo iq I iQ^r MdHp 

11: Reserved. The operation of the 
processor is UNDEFINED if this value is 
written to the KSU field 

Note: This field overlaps the UM and 
R0 fields, described below. 


R/W 


ERL 


2 


Error Level; Set by the processor when 
a Reset, Soft Reset, NMI or Cache 
Error exception is taken. 

0: normal level 
1: error level 

When ERL is set: 

- The processor is running in Kernel 
Mode 

- Interrupts are disabled 

- The ERET instruction will use the 
return address held in ErrorEPC instead 
of EPC 

- The lower 2 29 bytes of kuseg are 
tread as an unmapped and uncached 
region. This allows main memory to be 
accessed in the presence of cache 
errors. The operation of the processor 
is UNDEFINED if the ERL bit is set 
while the processor is executing 
instructions from kuseg. 


R/W 


EXL 


1 


Exception Level; Set by the processor 
when any exception other than Reset, 
Soft Reset, NMI or cache Error 
exception is taken. 

0: normal level 


R/W 
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Fields 


Description 


Read/ 
Write 


Name 


Bits 


1 : exception level 
When EXL is set: 

- The processor is running in Kernel 
Mode 

- Interrupts are disabled 

- TLB refill exceptions will use the 
general exception vector instead of the 

Tl R r of ill x/optnr 

- EPC and Cause B D will not be updated 
if another exception is taken 




IE 


0 


Interrupt Enable: Acts as the master 


R/W 






enable for software and hardware 








interrupts: 








0: disable interrupts 








1: enables interrupts 





* Information about MIPS32, MIPS64, MDMX architecture specifications are 
available from MIPS Technologies, Inc of Mountain View, California. 

[0033] When an interrupt occurs during execution of 
either a user program, or in the middle of servicing 
another interrupt, the CPU 102 may be required to modify 
bits within the SR register 200. The logical flow of such 
modification is shown below with reference to Figure 3. 



[0034] In Figure 3, a flow chart 3 00 is provided that 
particularly illustrates program flow within the CPU 102 
upon occurrence of an interrupt, per the scenario discussed 
briefly above in the Background. A user mode program 301 
is shown executing, until an interrupt 3 03 occurs. One 
skilled in the art will appreciate that the interrupt may 
occur from any of a number of sources external to the CPU 
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102. At this point, all instructions that have 

successfully completed execution within the CPU 102 
continue to pass thru remaining pipeline stages (not shown) 
within the CPU 102. However, instructions that have not 
yet completed execution are halted, and will be restarted 
upon return from the interrupt. 

[0035] Within the MIPS architecture, when an interrupt 
occurs, the interrupt will be serviced by the CPU 102 if: 
1) interrupts are enabled (the IE bit is set); 2) the mask 
bit (IMO-7) for the particular interrupt is set; and 3) the 
EXL bit is clear. Otherwise, the CPU 102 will ignore 
interrupts. More specifically, if the CPU 102 is in user 
mode, the EXL bit is clear. When an interrupt occurs, the 
interrupt will cause the EXL bit to be set, thereby 
disabling all further interrupts until the EXL bit is 
cleared. So, if the CPU 102 is in user mode when interrupt 
3 03 occurs, the EXL bit is set by the interrupt, and 
instruction flow proceeds to decision block 302. 

[0036] At decision block 302 a determination is made as 
to whether the interrupt will require very little 
processing, or significant processing (as determined from 
the type of interrupt) . If the interrupt serviced by the 
routine can be completed in just a few instructions, the 
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programmer may allow the EXL bit to remain set, thereby 
disabling further interrupts while servicing the interrupt. 
If the programmer determines that the interrupt will 
require very little processing, and that there is no need 
to re-enable other interrupts, flow proceeds to block 304. 
Otherwise, flow proceeds to block 3 06. 

[0037] At block 304, the interrupt is serviced. Upon 
completion of servicing the interrupt, the CPU 102 executes 
a return from exception instruction to cause the processor 
to jump back to user mode where it left off. In addition, 
the return from exception instruction clears the EXL bit to 
allow interrupts. 

[0038] Back at decision block 302, if the interrupt 
requires more significant time to process, the programmer 
may wish to re- enable other interrupts before proceeding 
with the processing. Flow therefore proceeds to block 3 06. 

[0039] At block 306, the state of the CPU 102 is stored 
to memory. One skilled in the art will appreciate that the 
state of the processor includes the contents of the EPC 
register (i.e., the register holding the contents of the 
program counter prior to jumping to the exception handler) , 
the SR, and possibly other state information. The state of 
the processor is stored prior to enabling interrupts so 
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that if the exception routine is interrupted, and the 
contents of SR and/or EPC get overwritten, they may later 
be restored. Flow then proceeds to block 308. 

[0040] At block 308 interrupts are re-enabled. To 
accomplish this, the programmer clears the EXL bit, and 
modifies the contents of the interrupt mask bits (IMO-7) to 
disable the interrupt bit pertaining to the present 
interrupt, while enabling other interrupts. Clearing and 
modification of bits in the SR register are not done 
directly. Rather, bits in status registers, such as SR, 
are first moved into the GPR 106, then modified, and then 
moved back to the SR. So, at block 308 the contents of the 
SR register are moved to a register within the GPR 106. 
Then, according to the exception handler program, 
particular bits within the SR are set or cleared in the 
copy of SR that was moved to GPR 106. Such bits include 
the interrupt mask bits IMO-7, the EXL bit, and possibly 
the interrupt enable bit (IE) . After modification, the 
contents of the register in GPR 106 that contains the 
modified fields within the SR are moved back into the SR 
register. At this point, updating of the SR register is 
complete, and interrupts are re-enabled. One skilled in 
the art will appreciate at this point that the RMW sequence 
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has not been interrupted because during the modification, 
the EXL bit is still set. Flow then proceeds to block 310. 

[0041] At block 310, the service routine for the 
particular interrupt executes. Upon completion of 

servicing the interrupt, it is necessary to restore the 
state of the CPU 102 to the state it was in before the 
interrupt occurred. However, since restoring the state of 
the processor requi]res a R-M-W sequence that may itself be 
interrupted, interrupts are first disabled. This is shown 
at block 312 . 

[0042] At block 312, the contents of the SR register are 
moved to a register in the GPR 106, the interrupt enable 
(IE) bit is cleared thereby disabling interrupts, and the 
contents of the register in the GPR 106 are moved back to 
the SR. Once the IE bit in the SR has been cleared, the 
state of the CPU 102 may be restored without fear of being 
interrupted. Flow then proceeds to block 314. 

[0043] At block 314, the state of the CPU 102 is 
restored. EPC is restored from memory, SR is restored from 
memory, and any other state information that was stored in 
block 306 is restored. A return from the exception places 
the contents of EPC into the program counter of the CPU 102 
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to cause instruction flow to begin again at the point of 
interruption . 

[0044] The flow chart of Figure 3 is provided to 
particularly illustrate several aspects of interrupt 
handling. The first is that when an interrupt occurs, a 
context switch is made to kernel mode within the CPU 102, 
unless of course the CPU 102 is already executing in kernel 
mode (in which case a switch is made to Kernel mode with 
the EXL bit being set) . That is, interrupt service 
routines, exception handlers, etc., that require access to 
the control registers 110 within the CPU 102, operate in a 
different, i.e., higher privilege level than user programs 

(since user programs do not have access to the control 
registers 110) . 

[0045] In addition, when an interrupt occurs, if the 
service routine requires more than a few instructions to 
execute, the state of the processor must first be stored 
into memory, and interrupts re-enabled. Upon completion of 
servicing the interrupt, the state of the processor must be 
restored. However, since interrupts may still occur, 
interrupts must first be disabled prior to restoring the 
state of the processor. But, to disable interrupts, a R-M- 
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W sequence must occur between the SR and the GPR 106, a 
sequence which itself might be interrupted. 

[0046] Figure 4 provides a flow chart 400 illustrating 
the effect of multiple interrupts while handling the 1 st 
interrupt described above in Figure 3 . Suppose an 
interrupt 4 02 occurs prior to deciding whether the 1 st 
interrupt is going to be time consuming, or while handling 
a short interrupt. At this point, interrupt 4 02 will be 
ignored because the EXL bit remains set, thereby disabling 
all interrupts. Therefore, no problem occurs with respect 
to the state of the CPU 102, or the SR register. 

[0047] Suppose an interrupt 404 occurs while the state 
of the CPU 102 is being saved, or while other interrupts 
are being enabled at block 306. At this point, the EXL bit 
is still set thereby disabling interrupt 404. The state of 
the CPU 102 is preserved, and the instruction flow that 
modifies the SR is preserved. Still there is no problem. 

[0048] Suppose however that an interrupt 406 occurs 

after interrupts have been re-enabled. Recall that to 
insure that the CPU 102 returns to the state that it was in 
prior to handling the 1 st interrupt, its state at the time 
of the 1 st interrupt must be restored. The restoration of 
the state of the SR however is a R-M-W sequence that itself 
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may be interrupted. That is why block 312 disables 
interrupts prior to restoring the state of the CPU 102. 
But, block 312 may Itself be interrupted. If this occurs, 
it is possible that the state of the CPU 102 might not be 
restored to the condition it was in before the interrupt. 
Such an instance can be disastrous. 

[0049] Referring now to Figure 5, a flow chart 500 is 
shown that illustrates a methodology for insuring that the 
R-M-W sequence of block 312 is not prevented from 
accomplishing its purpose completing and proceeding to 
block 314. More specifically, Figure 5 illustrates a 
requirement that all interrupt service routines be strictly 
nested. That is, that all interrupts store the state of 
CP0 prior to proceeding, restore it upon completion, and 
always return to the location they interrupted. 

[0050] Flow starts at block 502 where update of the SR 
begins (such as the disabling of interrupts by clearing the 
IE bit, as shown in block 312) . Flow then proceeds to 
block 504. 

[0051] At block 504, the contents of the SR register are 
moved to the GPR 106. At this point, an interrupt 530 
occurs. Since the interrupt 530 has not been disabled, 
program flow jumps to an exception handler that begins an 
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interrupt service routine, at block 512. Flow then 
proceeds to block 514 . 

[0052] At block 514, to insure that the contents of the 
SR are not lost, at least at the point of the interrupt, 
the state of the CPO 108, including the SR register, is 
saved. Flow then proceeds to block 516. 

[0053] At block 516, the interrupt 530 is serviced by an 
appropriate interrupt service routine. One skilled in the 
art will appreciate that the interrupt service routine may 
change the state of the SR register. Flow then proceeds to 
block 518. 

[0054] At block 518, the state of CPO, including the SR 
register is restored to the condition it was in before 
servicing interrupt 530. Flow then proceeds to block 520. 

[0055] At block 52 0, a return from interrupt is executed 
that places program execution back in the R-M-W sequence, 
at block 506. Instruction flow then proceeds thru blocks 
506, 508, and 510 to complete the update of the SR. 

[0056] At least two aspects of the solution presented in 
Figure 5 should be clear to one skilled in the art. First, 
the instruction flow of Figure 5 insists that no interrupt 
service routine be allowed to change the contents of SR. 
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That is, whatever else happens during servicing of an 
interrupt, at the end of the interrupt, the service routine 
must restore the contents of the SR that existed when the 
service routine began. As mentioned above, this imposition 
is unacceptable in many environments. Second, the 

instruction flow presented in Figure 5 insists that all 
interrupt service routines be strictly nested. That is, 
whenever a departure in normal program flow occurs, as a 
result of an interrupt, the interrupt service routine is 
required, upon completion, to return to the point of 
departure. This requirement is common in some programming 
environments, but in many real-time environments, such as 
network routing, strict nesting of interrupt service 
routines is not desired. Moreover, in some environments, 
substantial program code has already been developed that 
does not enforce strict nesting of interrupts, and 
rewriting of the code to overcome the R-M-W problems 
described above would be too time consuming, as well as too 
costly. 

[0057] For these reasons, the atomic update of CPO state 
methodology of the present invention is desired, and will 
now be described with reference to Figures 6-8. 
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[0058] Referring first to Figure 6, a table 600 is 
provided that illustrates four new instructions that are 
being added to the MIPS32 and MIPS64 architecture. The 
instructions include: BICC0 , BISC0, DBICC0, and DBISC0. 
Each of these instructions, and their operation on the 
contents of the SR, will now be described in detail with 
reference to Figure 7 . 

[0059] Figure 7 provides a block diagram of the 32-bit 
COP0 opcodes for these four new instructions. The bit 
fields for the first instruction BICC0 is shown in block 
diagram 702. 

[0060] Instruction BICCO, called Bit Clear to 
Coprocessor 0 is used to atomically clear bits in a 
coprocessor 0 register corresponding to bits that are set 
in a GPR. The format of the instruction is: 

BICCO rt,rd 

and is described as follows: 

CPR[r0, rd, sel] <-CPR [rO , rd, sel] and not rt 

[0061] The bits set in GPR rt are used to clear the 
corresponding bits in the coprocessor 0 register specified 
in the rd and sel fields. Not all coprocessor 0 registers 
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support the sel field. In those instances, the sel field 
must be zero. 

[0062] The results are undefined if coprocessor 0 does 
not contain a register as specified by rd and sel. If 
access to coprocessor 0 is not enabled, a coprocessor 
unusable exception is signaled. 

[0063] The below operation specification is for the 
general Move To operation with the lop field as a variable. 
The instruction BICC0 has a specific value for lop. The 
specification is: 

if isCoprocessorEnabled (0) then 
dataf-GPR[rt] 

CPR[0,rd,sel] <- CPR [0 , rd, sel] and not [rt] 

else 

Signal Exception (CoprocessorUnusable, 0) 

endif 

[0064] Instruction BISCO, called Bit Set to Coprocessor 

0 is used to atomically set bits in a coprocessor 0 
register corresponding to bits that are set in a GPR. A 
block diagram 7 02 is provided illustrating the bit fields 
in the BISC0 instruction. The format of the instruction 
is : 

BISC0 rt,rd 
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and is described as follows: 

CPR[rO, rd, sel] <-CPR [rO , rd, sel] or rt 

[0065] The bits set in GPR rt are used to set the 
corresponding bits in the coprocessor 0 register specified 
in the rd and sel fields. Not all coprocessor 0 registers 
support the sel field. In those instances, the sel field 
must be zero . 

[0066] The results are undefined if coprocessor 0 does 
not contain a register as specified by rd and sel. If 
access to coprocessor 0 is not enabled, a coprocessor 
unusable exception is signaled. 

[0067] The below operation specification is for the 
general Move To operation with the lop field as a variable. 
The instruction BICC0 has a specific value for lop. The 
specification is: 

if isCoprocessorEnabled(O) then 
data^GPR[rt] 

CPR[0,rd, sel] <- CPR [0 , rd, sel] and not [rt] 

else 

Signal Exception (CoprocessorUnusable, 0) 

endif 

[0068] Instruction DBICC0, called Double Bit Clear to 
Coprocessor 0 is used to atomically clear bits in a 
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coprocessor 0 register corresponding to bits that are set 
in a GPR. A block diagram 704 is provided illustrating the 
bit fields in the DBICCO instruction. The format of the 
instruction is: 

DBICCO rt,rd 
DBICCO rt,rd,sel 

and is described as follows: 

CPR[r0, rd, sel]^CPR[rO, rd, sel] and not rt 

[0069] The bits set in GPR rt are used to clear the 
corresponding bits in the coprocessor 0 register specified 
in the rd and sel fields. Not all coprocessor 0 registers 
support the sel field. In those instances, the sel field 
must be zero. 

[0070] The results are unpredictable if coprocessor 0 
does not contain a register as specified by rd and sel, or 
if the coprocessor 0 register specified by rd and sel is a 
32-bit register. If access to coprocessor 0 is not 
enabled, a coprocessor unusable exception is signaled. If 
access to coprocessor 0 is enabled but access to 64 -bit 
operations is not enabled, a Reserved Instruction Exception 
is signaled. 
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[0071] The below operation specification is for the 
general Doubleword Move To operation with the lop field as 
a variable. The instruction DBICCO has a specific value 
for lop. The specification is: 

if isCoprocessorEnabled (0) then 

if (not Are64bitOperationsEnabled() ) then 
SignalException (Reservedlnstruction) 

endif 

datadoubleword<-GPR [rt] 

CPR[0,rd, sel] CPR [0 , rd, sel] and not datadoubleword 

else 

Signal Exception (CoprocessorUnusable, 0) 

endif 

[0072] Instruction DBISC0, called Double Set Clear to 
Coprocessor 0 is used to atomically set bits in a 
coprocessor 0 register corresponding to bits that are set 
in a GPR. A block diagram 706 is provided illustrating the 
bit fields in the DBISC0 instruction. The format of the 
instruction is: 

DBISC0 rt, rd 
DBISCO rt,rd,sel 

and is described as follows: 

CPR[r0, rd, sel] <-CPR [rO , rd, sel] or rt 

[0073] The bits set in GPR rt are used to set the 
corresponding bits in the coprocessor 0 register specified 
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in the rd and sel fields. Not all coprocessor 0 registers 
support the sel field. In those instances, the sel field 
must be zero. 

[0074] The results are unpredictable if coprocessor 0 
does not contain a register as specified by rd and sel, or 
if the coprocessor 0 register specified by rd and sel is a 
32-bit register. If access to coprocessor 0 is not 
enabled, a coprocessor unusable exception is signaled. If 
access to coprocessor 0 is enabled but access to 64 -bit 
operations is not enabled, a Reserved Instruction Exception 
is signaled. 

[0075] The below operation specification is for the 
general Doubleword Move To operation with the lop field as 
a variable. The instruction DBISC0 has a specific value 
for lop. The specification is: 

if IsCoprocessorEnabled(O) then 

if (not Are64bitOperationsEnabled() ) then 
SignalException (Reservedlnst ruction) 

endif 

datadoubleword^-GPR [rt] 

CPR[0,rd,sel] <- CPR [0 , rd, sel] and not datadoubleword 

else 

Signal Exception (CoprocessorUnusable , 0) 

endif 

[0076] The above discussion of Figure 7, describes the 
operation of each of the CP0 atomic update instructions of 
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the present invention. To better comprehend the 

application of these instructions, within the context of an 
interrupt, reference is now directed to Figure 8. 

[0077] Figure 8 provides a flow chart 800 that 
illustrates the benefit of the atomic update instructions 
to modify the state of the SR register, particularly when 
disabling interrupts by clearing the IE bit. During 
execution of a user program 801, an interrupt 803 occurs. 
Instruction flow jumps to an exception handler responsible 
for handling the interrupt 803. Instruction flow proceeds 
similarly to that described above with reference to Figure 
3. However, at block 812, one of the atomic update 
instructions described above with reference to Figures 6 
and 7 is used to update the contents of SR, without 
requiring the R-M-W sequence described above. For example, 
an atomic update instruction could be used to atomically 
clear the IE bit in the SR register, thereby atomically 
disabling interrupts. Thus, when the interrupt 811 occurs 
during execution of block 812, it is not serviced because 
interrupts will have already been disabled by the atomic 
instruction . 

[0078] More generally, depending on whether an exception 
handler wishes to set or clear bits in the SR, and whether 
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the CPU 102 is operating in 32-bit or 64-bit mode, the 
exception handler utilizes one of the four instructions 
shown in Figure 6. If the programmer of the exception 
handler wishes to set bits in the SR, s/he will use either 
the BISCO or DBSCO instruction. If the programmer wishes 
to clear bits in the SR, s/he will use either the BICCO or 
DBICCO instruction. The particular bits that will be set 
or cleared will be based on a bit mask stored in one of the 
registers in the GPR 106. 

[0079] One skilled in the art will appreciate that since 
selected bit fields in the SR register, or any other of the 
control registers 110, are set atomically (i.e., within a 
single cycle) , there is no requirement that interrupt 
service routines be strictly nested. Nor is there any 
requirement that interrupts be disabled during modification 
of any of the control registers, such as when restoring the 
state of the CPU 102. Thus, the present invention imposes 
none of the previcus limitations relating to interrupt 
handling (e.g., disabling of interrupts, strict nesting of 
interrupt service routines, etc.), while enhancing a kernel 
programs ability to modify selected control registers 110 
without a R-M-W sequence. Furthermore, when modification 
of a control register 110 is desired, it may be 
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accomplished more efficiently since such modification 
requires only a single processing cycle. 

[0080] Although the present invention and its objects, 
features, and advantages have been described in detail, 
other embodiments are encompassed by the invention. In 
addition to implementations of the invention using 
hardware, the invention can be embodied in software 
disposed, for example, in a computer usable (e.g., 
readable) medium configured to store the software (i.e., a 
computer readable program code) . The program code causes 
the enablement of the functions or fabrication, or both, of 
the invention disclosed herein. For example, this can be 
accomplished through the use of general programming 
languages (e.g., C, C++, etc.), hardware description 
languages (HDL) including Verilog HDL, VHDL, AHDL (Altera 
Hardware Description Language) and so on, or other 
programming and/or circuit (i.e., schematic) capture tools 
available in the art. The program code can be disposed in 
any known computer usable medium including semiconductor 
memory, magnetic disk, optical disc (e.g., CD-ROM, DVD-ROM, 
etc.) and as a computer data signal embodied in a computer 
usable (e.g., readable) transmission medium (e.g., carrier 
wave or any other medium including digital, optical or 
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analog-based medium) . As such, the code can be transmitted 
over communication networks including the Internet and 
intranets. It is understood that the functions 

accomplished and/or structure provided by the invention as 
described above can be represented in a core (e.g. a 
microprocessor core) that is embodied in program code and 
may be transformed to hardware as part of the production of 
integrated circuits, Also, the invention may be embodied 
as a combination of hardware and software. 

[0081] Also, although the coprocessor interface has been 
described with particular reference to MIPS32 and MIPS64 
Instruction Set Architecture, one skilled in the art will 
appreciate that the applicability of the atomic update of 
privileged resources is not limited to such architecture. 

[0082] Finally, those skilled in the art should 
appreciate that they can readily use the disclosed 
conception and specific embodiments as a basis for 
designing or modifying other structures for carrying out 
the same purposes of the present invention without 
departing from the spirit and scope of the invention as 
defined by the appended claims. 

I claim: 
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